Assessment of Tweet Credibility with LDA Features

نویسندگان

  • Jun Ito
  • Jing Song
  • Hiroyuki Toda
  • Yoshimasa Koike
  • Satoshi Oyama
چکیده

With the fast development of Social Networking Services (SNS) such as Twitter, which enable users to exchange short messages online, people can get information not only from the traditional news media but also from the masses of SNS users. However, SNS users sometimes propagate spurious or misleading information, so an effective way to automatically assess the credibility of information is required. In this paper, we propose methods to assess information credibility on Twitter, methods that utilize the “tweet topic” and “user topic” features derived from the Latent Dirichlet Allocation (LDA) model. We collected two thousand tweets labeled by seven annotators each, and designed effective features for our classifier on the basis of data analysis results. An experiment we conducted showed a 3% improvement in Area Under Curve (AUC) scores compared with existing methods, leading us to conclude that using topical features is an effective way to assess tweet credibility.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CAT: Credibility Analysis of Arabic Content on Twitter

Data generated on Twitter has become a rich source for various data mining tasks. Those data analysis tasks that are dependent on the tweet semantics, such as sentiment analysis, emotion mining, and rumor detection among others, suffer considerably if the tweet is not credible, not real, or spam. In this paper, we perform an extensive analysis on credibility of Arabic content on Twitter. We als...

متن کامل

ClaimFinder: A Framework for Identifying Claims in Microblogs

Twitter is a microblogging platform that allows users to post public short messages. Posts shared by users pertaining to real-world events or themes can provide a rich “on-theground” live update of the events for the benefit of everyone. Unfortunately, the posted information may not be all credible and rumours can spread over this platform. Existing credibility assessment work have focused on i...

متن کامل

A Hybrid Approach for Multimedia Use Verification

Social networks enable multimedia sharing between worldwide users, however, there is no automatic mechanism implemented aiming to verifying multimedia use. This has been known as a highly challenging problem due to the variety of media types and huge amount of information they convey. As a participating team of MediaEval 2016, we propose a hybrid approach for detecting misused multimedia on Twi...

متن کامل

غربال‌‌گری خودکار افراد خطاکار با تحلیل تفکیک‌پذیری مشخصات سیگنال‌های هدایت الکتریکی پوست و حجم‌‌سنجی نوری

Credibility assessment screening by a small system and receiving optimum result in minimum time is a basic need in critical gates. Therefore the aim of this research is automatic detection of stress in guilty persons through skin conductance response and photoplethysmograph signals which are convenient and ease-of-use sensors .In this paper, a set of database with interview protocol (including ...

متن کامل

User Perception of Information Credibility of News on Twitter

In this paper, we examine user perception of credibility for news-related tweets. We conduct a user study on a crowd-sourcing platform to judge the credibility of such tweets. By analysing user judgments and comments, we find that eight features, including some that can not be automatically identified from tweets, are perceived by users as important for judging information credibility. Moreover...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015